Author Details

Neural Machine Translation systems produce state-of-art translation for high resource languages. It is yet a challenge in low-resource and morphologically rich languages. In this paper, we have discussed the existing techniques in handling the morphologically rich and low-resource languages and presented our experiments on developing English-Malayalam NMT system where we have processed the data using different techniques namely word segmentation using morphological analyser and applying Byte pair Encoding (BPE) technique. The results show a significant improvement by implementing the word segmentation using morphological analyser.

Keywords

Neural Machine Translation, Morphologically rich languages, Morph segmentation, Byte Pair Encoding.

Full Text

References

Bahdanau D., Cho K., and Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473

Banerjee, A.,Jain A., Mhaskar S., Deoghare S,D. Sehgal A., and Bhattacharya, P. (2021). Neural Machine Translation in Low-Resource Setting: a Case Study in English-Marathi Pair. In Proceedings of the 18th Biennial Machine Translation Summit - Volume 1: Research Track, MTSummit 2021 Virtual, pp 35-47

Cho, K., van Merrienboer, B., Gulcehre, C., Bougares, F., Schwenk, H., and Bengio, Y. (2014).Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014).

Dewangan, S., Alva, S., Joshi, N., Bhattacharyya, P. (2021). Experience of neural machine translation between Indian languages. Machine Translation 35, 71–99

Dominik Macháček, Jonáš Vidra, Ondřej Bojar (2018): Morphological and LanguageAgnostic Word Segmentation for NMT. In: Proceedings of the 21st International Conference on Text, Speech and Dialogue—TSD 2018, pp. 277-284, Springer-Verlag, Cham, Switzerland, ISBN 978-3-030-00794-2

Goyal, Vikrant and Kumar, Sourav and Sharma, Dipti Misra. (2020). Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pp 162-168

Hema Ala, Vandan Mujadia, Dipti Misra Sharma. (2021). Domain Adaptation for HindiTelugu Machine Translation Using Domain Specific Back Translation. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pp 26-34

Kalchbrenner, N. and Blunsom, P. (2013). Recurrent continuous translation models. In Proceedings of the ACL Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1700–1709. Association for Computational Linguistics.

Kim, Y., Petrov, P., Petrushkov, P., Khadivi, S., and Ney, H. (2019). Pivot-based transfer learning for neural machine translation between non-English languages. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Proˇcessing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 866– 876, Hong Kong, China. Association for Computaˇtional Linguistics

Klein G., Hernandez F., Nguyen V., and Senellart J. (2020) The opennmt neural machine translation toolkit: 2020 edition. In Proceedings of the 14th Conference of the Association for Machine Translation in the Americas (AMTA 2020), pages 102–109.

Koneru, Sai; Liu, Danni; Niehues, Jan. (2021). Unsupervised Machine Translation On Dravidian Languages, In 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages.

Lakshmi S., and Sobha Lalitha Devi (2013).”Malayalam Morphological Analyser”, In processings of International Seminar on Current Trends in Dravidian Linguistics, May 27-29, 2013

Laskar SR., Paul B., Adhikary PK, Pakray P., Bandyopadhyay S. (2021), Neural Machine Translation for Tamil–Telugu Pair. In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 284–287

Luong M., Pham H., and Manning D. (2015). Effective approaches to attention-based neural machine translation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1412–1421.

Mujadia V. and Dipti Sharma. (2020) NMT based Similar Language Translation for Hindi - Marathi. In Proceedings of the Fifth Conference on Machine Translation, pages 414–417, Online. Association for Computational Linguistics.

Papineni K., Roukos S., Ward T., and Zhu W (2002) Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting of the Association for Computational Linguistics, pages 311–318.

Ranathunga, Surangika, En-Shiun Annie Lee, Marjana Prifti Skenduli, Ravi Shekhar, Mehreen Alam, and Rishemjit Kaur. 2021. Neural machine translation for low-resource languages: A survey. CoRR, abs/2106.15115.

Saldanha R., Ananthanarayana V. S and Anand Kumar M and Parameswari K. (2021) NITK-UoH: Tamil-Telugu Machine Translation Systems for the WMT21 Similar Language Translation Task. In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 299–303

Sennrich R., Haddow B., and Birch A. (2016) Neural machine translation of rare words with subword units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1715–1725.

Sennrich, R., Haddow, B., and Birch, A. (2016). Improving neural machine translation models with monolingual data. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 86–96, Berlin, Germany. Association for Computational Linguistics.

Sutskever, I., Vinyals, O., and Le, Q. (2014). Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems (NIPS 2014)

Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, U.; Polosukhin, I. (2017) Attention is All You Need. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9

Zhao, Y., Y. Wang, J. Zhang, and C. Zong (2018). Phrase table as recommendation memory for neural machine translation. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden., pp. 4609–4615.

Event Extraction from social media Text in Malayalam using Neural Conditional Random Fields

Abstract Views :74 | PDF Views:0

Authors

Pattabhi RK Rao ¹, Sobha Lalitha Devi ¹

Affiliations
1 AU-KBC Research Centre, MIT Campus of Anna University, Chennai, India., IN

Source

Research Cell: An International Journal of Engineering Sciences, Vol 35 (2023), Pagination: 01-07

Abstract

This paper describes a Neural Conditional Random Fields (NCRF) approach for Event extraction (EE) task which aims to discover different types of events along with the event arguments from the user generated text content (tweets) in Malayalam. The data for this work was obtained from FIRE (Forum for Information Retrieval and Evaluation) 2017 shared task [12] on Event Extraction from Newswires and Social Media Text in Indian Languages. A NCRF is a combination of Recurrent Neural Network (RNN) and Conditional Random Fields (CRF). In addition to event detection, the system also extracts the event arguments which contain the information related to the events such as when (Time), where (Place), Reason, Casualty, Aftereffect etc). Our proposed Event Extraction system achieves F-score of 79.74%. The results are encouraging and comparable with the state-of-art.

Keywords

Event Extraction, Social Media Text, Indian Languages, Malayalam, Neural Conditional Random Fields (NCRF).

Full Text

References

Banko M, Cafarella MJ, Soderland S. (2007). Open information extraction for the web. IJCAI 2007; 7:2670–2676.

Collobert R, Weston J, Bottou L,. (2011) Natural language processing (almost) from scratch. The Journal of Machine Learning Research 2011; 12:2493–2537

Mark Dredze, Tim Oates, and Christine Piatko, (2010). “We’re not in Kansas anymore: detecting domain changes in streams”. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp 585–595. Association for Computational Linguistics (ACL).

Erhan D, Bengio Y, Courville A. (2010). Why does unsupervised pre-training help deep learning? The Journal of Machine Learning Research 2010; 11:625–660 [6] Dr. Moh. Osama K., “HELLO Flood Counter Measure for Wireless Sensor Network,” International Journal of Computer Science and Security, vol. 2 issue 3, 2007, pp-57-64.

Hege Fromreide, Dirk Hovy, and Anders Søgaard, (2014). “Crowdsourcing and annotating NER for twitter#drift”. European language resources distribution agency

Hinton G, Osindero S, Teh Y-W. (2006). A fast learning algorithm for deep belief nets. Neural computation 2006; 18:1527–1554

Krizhevsky A, Sutskever I, Hinton GE. (2012). Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 2012; 1097– 1105

John Lafferty, Andrew McCallum and Fernando Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. 18th International Conference on Machine Learning, Morgan Kaufmann, San Francisco, USA.pp.282-289

Lamblin P, Bengio Y. (2010). Important gains from supervised fine-tuning of deep architectures on large labeled sets. NIPS 2010 Deep Learning and Unsupervised Feature Learning Workshop 2010

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. (2013). Efficient Estimation of Word Representations in Vector Space. In Proceedings of Workshop at ICLR.

Pattabhi R K Rao T, Vijay Sundar Ram R, Vijayakrishna R and Sobha L. (2007). 'A Text Chunker and Hybrid POS Tagger for Indian Languages'. In the Proceedings of IJCAI Workshop on Shallow Parsing for South Asian Languages, Hyderabad. pp. 9-12.

Pattabhi RK Rao and Sobha Lalitha Devi. (2017). 'EventXtract-IL: Event Extraction from Newswires and Social Media Text in Indian Languages@ FIRE 2017 - An Overview', In the Forum for Information Retrieval and Evaluation-2017.

Salakhutdinov R, Mnih A, Hinton G. (2007). Restricted Boltzmann Machines for Collaborative Filtering. Proceedings of the 24th International Conference on Machine Learning 2007; 791–798

Socher R, Lin CC, Manning C. (2011) Parsing natural scenes and natural language with recursive neural networks. Proceedings of the 28th international conference on machine learning (ICML-11) 2011; 129–136

Tang B, Wu Y, Jiang M. (2013) Recognizing and Encoding Disorder Concepts in Clinical Text using Machine Learning and Vector Space Model. Working Notes for CLEF 2013 Conference 2013; 1179

Jie Yang and Yue Zhang. (2018). NCRF++: An Open-source Neural Sequence Labeling Toolkit. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics-System Demonstrations, pages 74–79 Melbourne, Australia, July 15 - 20, 2018

Uzuner è„°zlem, South BR, Shen S. (2011) 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. Journal of the American Medical Informatics Association 2011; 18:552–556.

Username
Password
Remember me

Informatics Publishing Limited

Author Details

Devi, Sobha Lalitha

Neural Machine Translation for English-Malayalam

Authors

Source

Abstract

Keywords

Full Text

References

Event Extraction from social media Text in Malayalam using Neural Conditional Random Fields

Authors

Source

Abstract

Keywords

Full Text

References